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3 <110> APPLICANT: MEIJI SEIKA KAISHA, LTD. 

4 WATANABE, Manabu 

5 YANAI, Koji 

6 TSUYUKI , Yumiko 

8 <120> TITLE OF INVENTION: Surfactant tolerant cellulase and method for 
modification thereto 

10 <130> FILE REFERENCE: Q95278 
C--> 12 <140> CURRENT APPLICATION NUMBER: US/10/582,002 
C--> 12 <141> CURRENT FILING DATE: 2006-06-07 

12 <150> PRIOR APPLICATION NUMBER: PCT/JP2 004/018184 

13 <151> PRIOR FILING DATE: 2004-12-07 

15 <150> PRIOR APPLICATION NUMBER: JP 2003-409692 

16 <151> PRIOR FILING DATE: 2003-12-08 
18 <160> NUMBER OF SEQ ID NOS : 48 

20 <170> SOFTWARE: Patentln version 3.1 

22 <210> SEQ ID NO: 1 

23 <211> LENGTH: 870 

24 <212> TYPE: DNA 

25 <213> ORGANISM: Humicola insolens MN200-1 
27 <220> FEATURE: 

2 8 <221> NAME/KEY: CDS 

29 <222> LOCATION: (1) . . (870) 

31 <220> FEATURE: 

32 <221> NAME /KEY : misc_feature 

33 <222> LOCATION: (1)..(3) 

34 <223> OTHER INFORMATION: Pyroglutamic acid 

37 <220> FEATURE: 

38 <221> NAME/KEY: source 

39 <222> LOCATION: (16).. (870) 

40 <223> OTHER INFORMATION: Humicola insolens MN200-1 
43 <400> SEQUENCE: 1 



44 


cag 


aac 


tgt 


gga 


tec 


get gat 


ggc 


aag 


tec 


acc 


cgc 


tac 


tgg gac tgc 


48 


45 


Gin 


Asn 


Cys 


Gly 


Ser 


Ala Asp 


Gly 


Lys 


Ser 


Thr 


Arg 


Tyr 


Trp Asp Cys 




46 


1 








5 








10 








15 




48 


tgc 


aag 


cct 


teg 


tgc 


ggc tgg 


gee 


aag 


aag 


get 


ccc 


gtg 


aac cag cct 


96 


49 


Cys 


Lys 


Pro 


Ser 


Cys 


Gly Trp 


Ala 


Lys 


Lys 


Ala 


Pro 


Val 


Asn Gin Pro 




50 








20 








25 










30 




52 


gtc 


ttc 


tec 


tgc 


aac 


gee aac 


ttc 


cag 


cgt 


etc 


act 


gac 


ttc gac gee 


144 


53 


Val 


Phe 


Ser 


Cys 


Asn 


Ala Asn 


Phe 


Gin 


Arg 


Leu 


Thr 


Asp 


Phe Asp Ala 




54 






35 








40 










45 






56 


aag 


tec 


ggc 


tgc 


gag 


ccg ggc 


ggt 


gtc 


gee 


tac 


teg 


tgc 


gee gac cag 


192 


57 


Lys 


Ser 


Gly 


Cys 


Glu 


Pro Gly 


Gly 


val 


Ala 


Tyr 


Ser 


Cys 


Ala Asp Gin 




58 




50 








55 










60 








60 


acc 


cca 


tgg 


get 


gtg 


aac gac 


gac 


ttc 


gcg 


ttc 


ggt 


ttt 


get gee acc 


240 
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61 


Thr 


Pro 


Trp 


Ala 


Val 


Asn 


Asp Asp 


Phe 


Ala 


Phe 


Gly 


Phe 


Ala 


Ala 


Thr 




62 


65 










70 










75 










80 




64 


tct 


att 


gee 


ggc 


age 


aat 


gag 


gcg 


ggc 


tgg 


tgc 


tgc 


gee 


tgc 


tac 


gag 


288 


65 


Ser 


He 


Ala 


Gly 


Ser 


Asn 


Glu 


Ala 


Gly 


Trp 


Cys 


Cys 


Ala 


Cys 


Tyr 


Glu 




66 










85 










90 










95 






68 


etc 


ace 


ttc 


aca 


tec 


ggt 


cct 


gtt 


get 


ggc 


aag 


aag 


atg 


gtc 


gtc 


cag 


336 


69 


Leu 


Thr 


Phe 


Thr 


Ser 


Gly 


Pro 


Val 


Ala 


Gly 


Lys 


Lys 


Met 


Val 


Val 


Gin 




70 








100 










105 










110 








72 


tec 


acc 


age 


act 


ggc 


ggt 


gat 


ctt 


ggc 


age 


aac 


cac 


ttc 


gat 


etc 


aac 


384 


73 


Ser 


Thr 


Ser 


Thr 


Gly 


Gly 


Asp 


Leu 


Gly 


Ser 


Asn 


His 


Phe 


Asp 


Leu 


Asn 




74 






115 










120 










125 










76 


ate 


ccc 


ggc 


ggc 


ggc 


gtc 


ggc 


ate 


ttc 


gac 


gga 


tgc 


act 


ccc 


cag 


ttc 


432 


77 


He 


Pro 


Gly 


Gly 


Gly 


Val 


Gly He 


Phe 


Asp 


Gly 


Cys 


Thr 


Pro 


Gin 


Phe 




78 




130 










135 










140 












80 


ggc 


ggt 


ctg 


ccc 


ggc 


cag 


cgc 


tac 


ggc 


ggc 


ate 


teg 


tec 


cgc 


aac 


gag 


480 


81 


Gly 


Gly 


Leu 


Pro 


Gly 


Gin 


Arg 


Tyr 


Gly 


Gly 


He 


Ser 


Ser 


Arg 


Asn 


Glu 




82 


145 










150 










155 










160 




84 


tgc 


gat 


egg 


ttc 


ccc 


gac 


gee 


etc 


aag 


ccc 


ggc 


tgc 


tac 


tgg 


cgc 


ttc 


528 


85 


Cys 


Asp 


Arg 


Phe 


Pro 


Asp 


Ala 


Leu 


Lys 


Pro 


Gly 


Cys 


Tyr 


Trp 


Arg 


Phe 




86 










165 










170 










175 






88 


gac 


tgg 


ttc 


aag 


aac 


gee 


gac 


aac 


ccg 


age 


ttc 


age 


ttc 


cgt 


cag 


gtc 


576 


89 


Asp 


Trp 


Phe 


Lys 


Asn 


Ala 


Asp Asn 


Pro 


Ser 


Phe 


Ser 


Phe 


Arg 


Gin 


Val 




90 








180 










185 










190 








92 


caa 


tgc 


cca 


gee 


gag 


etc 


gtc 


get 


cgc 


acc 


gga 


tgc 


cgc 


cgc 


aac 


gac 


624 


93 


Gin 


Cys 


Pro 


Ala 


Glu 


Leu 


Val 


Ala 


Arg 


Thr 


Gly 


Cys 


Arg 


Arg 


Asn 


Asp 




94 






195 










200 










205 










96 


gac 


ggc 


aac 


ttc 


cct 


gee 


gtc 


cag 


ate 


ccc 


tec 


age 


age 


acc 


age 


tct 


672 


97 


Asp 


Gly 


Asn 


Phe 


Pro 


Ala 


Val 


Gin 


He 


Pro 


Ser 


Ser 


Ser 


Thr 


Ser 


Ser 




98 




210 










215 










220 













100 


ccg 


gtc 


ggc 


cag 


cct 


acc 


agt 


acc 


age acc acc tec 


acc 


tec 


acc 


acc 


720 


101 


Pro 


Val 


Gly 


Gin 


Pro 


Thr 


Ser 


Thr 


Ser Thr Thr Ser 


Thr 


Ser 


Thr 


Thr 




102 


225 










230 






235 








240 




104 


teg 


age 


ccg 


ccc 


gtc 


cag 


cct 


acg 


act ccc age ggc 


tgc 


act 


get 


gag 


768 


105 


Ser 


Ser 


Pro 


Pro 


Val 


Gin 


Pro 


Thr 


Thr Pro Ser Gly 


Cys 


Thr 


Ala 


Glu 




106 










245 








250 






255 






108 


agg 


tgg 


get 


cag 


tgc 


ggc 


ggc 


aat 


ggc tgg age ggc 


tgc 


acc 


acc 


tgc 


816 


109 


Arg 


Trp 


Ala 


Gin 


Cys 


Gly Gly Asn 


Gly Trp Ser Gly 


Cys 


Thr 


Thr 


Cys 




110 








260 










265 




270 








112 


gtc 


get 


ggc 


age 


acc 


tgc 


acg 


aag 


att aat gac tgg 


tac 


cat 


cag 


tgc 


864 


113 


Val 


Ala 


Gly 


Ser 


Thr 


Cys 


Thr 


Lys 


He Asn Asp Trp 


Tyr 


His 


Gin 


Cys 




114 






275 










280 




285 











116 ctg taa 870 

117 Leu 

121 <210> SEQ ID NO: 2 

122 <211> LENGTH: 289 

123 <212> TYPE: PRT 

124 <213> ORGANISM: Humicola insolens MN200-1 

126 <22 0> FEATURE: 

127 <221> NAME/KEY: misc feature 
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128 <222> LOCATION: (1) . . (3) 

129 <223> OTHER INFORMATION: Pyroglutamic acid 



131 


<400> SEQUENCE: 


2 










133 


Gin 


Asn 


Cys 


Gly 


Ser 


Ala Asp Gly Lys Ser 


Thr Arg Tyr 


Trp Asp Cys 


134 


1 








5 


10 






15 


137 


Cys 


Lys 


Pro 


Ser 


Cys 


Gly Trp Ala Lys Lys 


Ala Pro 


Val 


Asn Gin Pro 


138 








20 




25 






30 


141 


Val 


Phe 


Ser 


Cys 


Asn 


Ala Asn Phe Gin Arg 


Leu Thr Asp 


Phe Asp Ala 


142 






35 






40 




45 




145 


Lys 


Ser 


Gly 


Cys 


Glu 


Pro Gly Gly Val Ala 


Tyr Ser 


Cys 


Ala Asp Gin 


146 




50 








55 


60 






149 


Thr 


Pro 


Trp 


Ala 


Val 


Asn Asp Asp Phe Ala 


Phe Gly Phe 


Ala Ala Thr 


150 


65 










70 


75 




80 


153 


Ser 


He 


Ala 


Gly 


Ser 


Asn Glu Ala Gly Trp 


Cys Cys Ala 


Cys Tyr Glu 


154 










85 


90 






95 


157 


Leu 


Thr 


Phe 


Thr 


Ser 


Gly Pro Val Ala Gly 


Lys Lys 


Met 


Val Val Gin 


158 








100 




105 






110 


161 


Ser 


Thr 


Ser 


Thr 


Gly 


Gly Asp Leu Gly Ser 


Asn His 


Phe 


Asp Leu Asn 


162 






115 






120 




125 




165 


He 


Pro 


Gly 


Gly 


Gly 


Val Gly He Phe Asp 


Gly Cys 


Thr 


Pro Gin Phe 


166 




130 








135 


140 






169 


Gly 


Gly 


Leu 


Pro 


Gly 


Gin Arg Tyr Gly Gly 


He Ser 


Ser 


Arg Asn Glu 


170 


145 










150 


155 




160 


173 


Cys 


Asp 


Arg 


Phe 


Pro 


Asp Ala Leu Lys Pro 


Gly Cys 


Tyr 


Trp Arg Phe 


174 










165 


170 






175 


177 


Asp 


Trp 


Phe 


Lys 


Asn 


Ala Asp Asn Pro Ser 


Phe Ser 


Phe 


Arg Gin Val 


178 








180 




185 






190 


181 


Gin 


Cys 


Pro 


Ala 


Glu 


Leu Val Ala Arg Thr 


Gly Cys Arg 


Arg Asn Asp 


182 






195 






200 




205 




185 


Asp 


Gly 


Asn 


Phe 


Pro 


Ala Val Gin He Pro 


Ser Ser 


Ser 


Thr Ser Ser 


186 




210 








215 


220 






189 


Pro 


Val 


Gly 


Gin 


Pro 


Thr Ser Thr Ser Thr 


Thr Ser 


Thr 


Ser Thr Thr 


190 


225 










230 


235 




240 


193 


Ser 


Ser 


Pro 


Pro 


Val 


Gin Pro Thr Thr Pro 


Ser Gly Cys 


Thr Ala Glu 


194 










245 


250 






255 


197 


Arg 


Trp 


Ala 


Gin 


Cys 


Gly Gly Asn Gly Trp 


Ser Gly Cys 


Thr Thr Cys 


198 








260 




265 






270 


201 


Val 


Ala 


Gly 


Ser 


Thr 


Cys Thr Lys He Asn 


Asp Trp 


Tyr 


His Gin Cys 


202 






275 






280 




285 





205 Leu 

209 <210> SEQ ID NO: 3 

210 <211> LENGTH: 897 

211 <212> TYPE: DNA 

212 <213> ORGANISM: Staphylotrichum coccosporum IFO 31817 

215 <220> FEATURE: 

216 <221> NAME/KEY: CDS 

217 <222> LOCATION: (1)..(897) 

219 <220> FEATURE: 

220 <221> NAME / KEY : misc feature 
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221 <222> LOCATION: (1) . . (3) 

222 <223> OTHER INFORMATION: Pyroglutamic acid 

225 <220> FEATURE: 

226 <221> NAME/ KEY : source 

227 <222> LOCATION: (13):. (897) 

228 <223> OTHER INFORMATION: Staphylotrichum coccosporum IFO 31817 

231 <400> SEQUENCE: 3 

232 cag teg gca tgc gat ggc aag tec acc cgc tac tgg gac tgc tgc aag 48 

233 Gin Ser Ala Cys Asp Gly Lys Ser Thr Arg Tyr Trp Asp Cys Cys Lys 

234 15 10 15 

236 cct teg tgc teg tgg ccc ggc aag gee teg gtg aac cag ccc gtc ttc 96 

237 Pro Ser Cys Ser Trp Pro Gly Lys Ala Ser Val Asn Gin Pro Val Phe 

238 20 25 30 

240 gec tgc age gec aac ttc cag cgc ate age gac ccc aac gtc aag teg 144 

241 Ala Cys Ser Ala Asn Phe Gin Arg lie Ser Asp Pro Asn Val Lys Ser 

242 35 40 45 

244 ggc tgc gac ggc ggc tec gee tac gee tgc gee gac cag acc ccg tgg 192 

245 Gly Cys Asp Gly Gly Ser Ala Tyr Ala Cys Ala Asp Gin Thr Pro Trp 

246 50 55 60 

248 gee gtc aac gac aac ttc teg tac ggc ttc gee gee acg tec ate teg 240 

249 Ala Val Asn Asp Asn Phe Ser Tyr Gly Phe Ala Ala Thr Ser lie Ser 

250 65 70 75 80 

252 ggc ggc aac gag gee teg tgg tgc tgt ggc tgc tac gag ctg acc ttc 288 

253 Gly Gly Asn Glu Ala Ser Trp Cys Cys Gly Cys Tyr Glu Leu Thr Phe 

254 85 90 95 

256 acc teg ggc ccc gtc get ggc aag acc atg gtt gtc cag tec acc teg 336 

257 Thr Ser Gly Pro Val Ala Gly Lys Thr Met Val Val Gin Ser Thr Ser 

258 100 105 110 

260 acc ggc ggc gac etc ggc acc aac cac ttc gac ctg gee atg ccc ggt 384 

261 Thr Gly Gly Asp Leu Gly Thr Asn His Phe Asp Leu Ala Met Pro Gly 

262 115 120 125 

264 ggt ggt gtc ggc ate ttc gac ggc tgc teg ccc cag ttc ggc ggc etc 432 

265 Gly Gly Val Gly lie Phe Asp Gly Cys Ser Pro Gin Phe Gly Gly Leu 

266 130 135 140 

268 gee ggc gac cgc tac ggc ggc gtc teg teg cgc age cag tgc gac teg 480 

269 Ala Gly Asp Arg Tyr Gly Gly Val Ser Ser Arg Ser Gin Cys Asp Ser 

270 145 150 155 160 

272 ttc ccc gee gee etc aag ccc ggc tgc tac tgg cgc ttc gac tgg ttc 528 

273 Phe Pro Ala Ala Leu Lys Pro Gly Cys Tyr Trp Arg Phe Asp Trp Phe 

274 165 170 175 

276' aag aac gee gac aac ccg acc ttc acc ttc cgc cag gtc cag tgc ccg 576 
2 77 Lys Asn Ala Asp Asn Pro Thr Phe Thr Phe Arg Gin Val Gin Cys Pro 
278 180 185 190 

280 teg gag etc gtc gee cgc acc ggc tgc cgc cgc aac gac gac ggc aac. 624 

281 Ser Glu Leu Val Ala Arg Thr Gly Cys Arg Arg Asn Asp Asp Gly Asn 

282 195 200 205 

284 ttc ccc gtc ttc acc cct ccc teg ggc ggt cag tec tec teg tct tec 672 

285 Phe Pro Val Phe Thr Pro Pro Ser Gly Gly Gin Ser Ser Ser Ser Ser 

286 210 215 220 
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288 


tec 


tec 


age 


age 


gee 


aag ccc 


acc 


tec 


acc 


tec 


acc 


teg 


acc 


acc tec 


289 


Ser 


Ser 


Ser 


Ser 


Ala 


Lys Pro 


Thr 


Ser 


Thr 


Ser 


Thr 


Ser 


Thr 


Thr Ser 


290 


225 










230 








235 








240 


292 


acc 


aag 


get 


acc 


tec 


acc acc 


teg 


acc 


gee 


tec 


age 


cag 


acc 


teg teg 


293 


Thr 


Lys 


Ala 


Thr 


Ser 


Thr Thr 


Ser 


Thr 


Ala 


Ser 


Ser 


Gin 


Thr 


Ser Ser 


294 










245 








250 










255 


296 


tec 


acc 


ggc 


ggc 


ggc 


tgc gee 


gee 


cag 


cgc 


tgg 


gcg 


cag 


tgc 


ggc ggc 


297 


Ser 


Thr 


Gly 


Gly 


Gly 


Cys Ala 


Ala 


Gin 


Arg 


Trp 


Ala 


Gin 


Cys Gly Gly 


298 








260 








265 










270 




300 


ate 


ggg 


ttc 


teg 


ggc 


tgc acc 


acg 


tgc 


gtc 


age 


ggc 


acc 


acc 


tgc aac 


301 


He 


Gly 


Phe 


Ser 


Gly 


Cys Thr 


Thr 


Cys 


Val 


Ser 


Gly 


Thr 


Thr 


Cys Asn 


302 






275 








280 










285 






304 


aag 


cag 


aac 


gac 


tgg 


tac teg 


cag 


tgc 


ctt 


taa 










305 


Lys 


Gin 


Asn 


Asp 


Trp 


Tyr Ser 


Gin 


Cys 


Leu 












306 




290 








295 


















309 


<210> SEQ ID NO; 


: 4 




















310 


<211> LENGTH: 298 




















311 


<212> TYPE: 


PRT 






















312 


<213> ORGANISM: 


Staphylotri chum 


coccosporum 


IFO 


31817 




314 


<220> FEATURE: 






















315 


<221> NAME /KEY : 


misc feature 
















316 


<222> LOCATION: 


(1) 


. . (3) 


















317 


<223> OTHER 


INFORMATION: Pyroglutamic acid 










319 


<400> SEQUENCE: 


4 




















321 


Gin 


Ser 


Ala 


Cys 


Asp 


Gly Lys 


Ser 


Thr 


Arg 


Tyr 


Trp 


Asp 


Cys 


Cys Lys 


322 


1 








5 








10 










15 


325 


Pro 


Ser 


Cys 


Ser 


Trp 


Pro Gly 


Lys 


Ala 


Ser 


Val 


Asn 


Gin 


Pro 


Val Phe 


326 








20 








25 










30 




329 


Ala 


Cys 


Ser 


Ala 


Asn 


Phe Gin 


Arg 


He 


Ser 


Asp 


Pro 


Asn 


Val 


Lys Ser 


330 






35 








40 










45 






333 


Gly 


Cys 


Asp 


Gly 


Gly 


Ser Ala 


Tyr 


Ala 


Cys 


Ala 


Asp 


Gin 


Thr 


Pro Trp 


334 




50 








55 










60 








337 


Ala 


Val 


Asn 


Asp 


Asn 


Phe Ser 


Tyr 


Gly 


Phe 


Ala 


Ala 


Thr 


Ser 


He Ser 


338 


65 










70 








75 








80 


341 


Gly 


Gly 


Asn 


Glu 


Ala 


Ser Trp 


Cys 


Cys 


Gly 


Cys 


Tyr 


Glu 


Leu 


Thr Phe 


342 










85 








90 










95 


345 


Thr 


Ser 


Gly 


Pro 


Val 


Ala Gly 


Lys 


Thr 


Met 


Val 


Val 


Gin 


Ser 


Thr Ser 


346 








100 








105 










110 




349 


Thr 


Gly 


Gly 


Asp 


Leu 


Gly Thr 


Asn 


His 


Phe 


Asp 


Leu 


Ala 


Met 


Pro Gly 


350 






115 








120 










125 






353 


Gly 


Gly 


Val 


Gly 


He 


Phe Asp 


Gly 


Cys 


Ser 


Pro 


Gin 


Phe 


Gly Gly Leu 


354 




130 








135 










140 








357 


Ala 


Gly 


Asp 


Arg 


Tyr 


Gly Gly 


Val 


Ser 


Ser 


Arg 


Ser 


Gin 


Cys 


Asp Ser 


358 


145 










150 








155 








160 


361 


Phe 


Pro 


Ala 


Ala 


Leu 


Lys Pro 


Gly 


Cys 


Tyr 


Trp 


Arg 


Phe 


Asp 


Trp Phe 


362 










165 








170 










175 


365 


Lys 


Asn 


Ala 


Asp 


Asn 


Pro Thr 


Phe 


Thr 


Phe 


Arg 


Gin 


Val 


Gin Cys Pro 


366 








180 








185 










190 




369 


Ser 


Glu 


Leu 


Val 


Ala 


Arg Thr 


Gly 


Cys 


Arg 


Arg 


Asn 


Asp 


Asp 


Gly Asn 



720 



768 



816 



864 



897 
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